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PROTEIN STRUCTURES AND PROTEIN FIBRES 



This invention relates to protein structures, to methods of producing those protein structures, 
and to protein fibres and other materials and assemblies produced using those protein 
structures. 

The process of molecular self-assembly is central to all biological systems and is assuming 
increasing importance and application in biotechnology (L. Q. Gu, et al (1999) Nature 398, 
686) and nanotechnology (K. E. Drexler, (1999) TIBTECH 17, 5). The characterization of 
natural biomolecular assemblies motivates and directs the development of model 
self-assembling systems and, in turn, these advance our understanding of biology. For 
proteins at least, the coiled coil is arguably the simplest self-assembling system. Coiled coils 
are protein- folding motifs that direct and cement a wide variety of protein-protein interactions 
(A. Lupas, (1996) Trends Biochem. Sci 21, 375). In structural terms, coiled coils are 
relatively straightforward: they are a-helical bundles with between 2 and 5 strands that can be 
arranged in parallel, antiparallel or mixed topologies. The basic sequence features that guide 
the formation of coiled coils from peptides are reasonably well understood (P. B. Harbury et 
al (1993) Science 262, 1401; D. N. Woolfson and T. Alber (1995) Protein Sci. 4, 1596). For 
instance, most coiled-coil sequences are dominated by a 7-residue repeat of hydrophobic (H) 
and polar (P) residues, (HPPHPPP)*, known as the "heptad repeat". When configured into an 
ct-helix this pattern gives an amphipathic structure, the hydrophobic face of which directs 
oligomer-assembly. Furthermore, both the number and the direction of chains within a 
coiled-coil bundle is determined predominantly by residues that form or flank the hydrophobic 
core namely, residues at the first, fourth, fifth and seventh positions of the heptad repeat. For 
instance, coiled coils which form dimers (i.e. two-stranded assemblies) usually have 
isoleucine or valine residues at the first position and a leucine residue at the fourth position. 
By contrast, coiled coils that form trimers (i.e. three-stranded assemblies) often have the same 
residues (i.e both isoleucine or both leucine) at both "H" positions. Finally, hetero-oligomers 
(that is coiled coils made from strands with different amino-acid sequences) may be directed 
by complementary charged interactions that flank the hydrophobic core. For these reasons, 
there have been a number of successful de novo protein designs based on the coiled coil 
These include some ambitious structures that extend the natural repertoire of coiled-coil 
motifs (S. Nautiyal et al (1995) Biochemistry 34, 11645; A. Lombardi et al (1996) 
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Biopolymers 40, 495; D. H. Lee et al (1996) Nature 382, 525; P. B. Harbury et al (1998) 
Science 282, 1462; J. P. Schneider et al (1998) Folding Des. 3, R29). 

In addition to commonly accepted structures with a single, contiguous heptad repeat, the 
inventors have identified sequences with multiple, offset heptad repeats which help explain 
oligomer-state specification in coiled coils. For example, sequences with two heptad repeats 
offset by two residues; i.e a/f-b/g'-c/a'-d/b'-e/c'-f/d'-g/e' set up two hydrophobic seams on 
opposite sides of the helix formed. Such helices may combine to bury these hydrophobic 
surfaces in two different ways and form two distinct structures: open "a-sheets" and closed 
"cc-cylinders". 

Other relevant aspects of coiled-coil structure are described in W099/1 1774, the disclosure of 
which is incorporated herein byway of reference. 

This understanding of coiled coils, and the resulting protein designs, centres on short 
structures as exemplified by the leucine-zipper motifs (E. K. O'Shea et al (1989) Science 243, 
538; E. K. O'Shea et al (1991) Science 254, 539), which are found in a variety of transcription 
factors. In contrast, most natural coiled coils extend over hundreds of amino acids (A. Lupas 
(1996) supra; J. Sodek et al (1972) Proc. Natl. Acad. Sci. U.S.A 69, 3800) and many 
assemble further to form thicker, multi-stranded filaments (H. Herrmann and U. Aebi (1998) 
Curr. Opin. Struct. Biol. 8, 177). 

With the goal of making elongated structures to improve our understanding of coiled coils, 
and to develop protein-design studies, we initially designed two 28-residue peptides — 
dubbed Self-Assembling Fibre peptides, SAF-pl and SAF-p2 — to fold and form extended 
fibres when mixed. Focusing on the buried, hydrophobic-core positions of the structure, rules 
were incorporated to direct parallel dimer formation and to guard against alternative 
oligomers and topologies (P. B. Harbury et al (1993) supra; D. N. Woolfson and T. Alber 
(1995) supra; L. J. Gonzalez et al (1996) Nature Struct. Biol. 3, 1011). The building block of 
the design was a staggered heterodimer with overhanging or "sticky" ends. This contrasts 
with and distinguishes it from the natural and designer coiled-coil assemblies that have been 
characterized to date, in which the polypeptide strands align in-register, i.e they have blunt or 
"flush" ends. Complementary core interactions and flanking ion-pairs were incorporated into 
the overhangs to facilitate longitudinal association of the heterodimers (Figs. 1&2). This 
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principle of using "sticky ends'* is well developed in molecular biology for assembling DNA 
(S. J. Palmer et al (1998) Nucleic Acids Res. 26, 2560), and has been used to design intricate 
DNA crystals (E. Winfree et al (1998) Nature 394, 539). However, to our knowledge, our 
application of sticky end-directed molecular assembly to peptides is new; although we do note 
that head-to-tail packing of helices has been observed in recently solved crystal structures for 
two designer peptides (N. L. Ogihara et al (1997) Proteii\ Sci. 6, 80; G. G. Prive et al (1999) 
Protein Sci. 8, 1400). These were helical peptides that crystallised with their helical ends in 
contact so as to form pseudo-continuous helices in the solid state. In other words they formed 
"blunt-ended" arrangements. 

According to one aspect of the invention there is provided a protein structure comprising a 
plurality of first peptide monomer units arranged in a first strand and a plurality of second 
peptide monomer units arranged in a second strand, the strands preferably forming a coiled- 
coil structure, and in which a first peptide monomer unit in the first strand extends beyond a 
corresponding second peptide monomer unit in the second strand in the direction of the 
strands. The protein structures of the invention have numerous advantages. For example, 
relatively long protein fibres can be formed with little material - 1 jil of a 100 \xM solution of 
the peptide monomers may provide enough material to form 10 m of fibre 50 nm thick. 

At least one charged amino acid residue of the first peptide monomer unit may be arranged to 
attract an oppositely-charged amino acid residue of the second peptide monomer unit. 
Preferably, the charged amino acid residue is in an end portion of the first peptide monomer 
unit, which extends beyond the corresponding second peptide monomer unit in the second 
strand. At least one strand may consist solely of first or second peptide monomer units 
respectively i.e homogenous strands. Heterologous strands are also contemplated. The 
peptide monomer units may comprise a repeating structural unit. Preferably, the repeating 
structural unit comprises a heptad repeat motif, having the pattern: 



Preferably, the repeat may include isoleucine or asparagine at position a and leucine at 
position d. Other repeats (e.g hendecads - abcdefghijk) and amino acid compositions may 
also be used (see WQ99/1 1774). 



hpphppp 
abcdef g 
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Preferably, the heptad repeat comprises oppositely-charged residues at positions e and g 
respectively. The oppositely-charged residues may be, for example, glutamic acid and lysine 
resumes or arginme and aspartic acid. The use of synthetic ammo adds, such as ornithine is 
also envisaged. 

A protein structure in accordance with the invention may be also specified by pairs of 
asparagine residues in the "a" positions provided by corresponding first and second peptide 
monomer units. 

In a preferred protein structure, the first and second peptide monomer units have the following 
sequences: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plQand 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2D) respectively; 
C) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA) and 

d) KI RALKWKNAHLKQE I AALEQE I AALEQ ( SAF-p2C) respectively; 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plQand 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C) respectively. 

It will be appreciated that these are examples only of 4-he P tad structures and that other 
lengths are possible and envisaged for use in the invention. 

According to another aspect of the invention, there is provided a method of producing protein 
structures, the method comprising providing a mixture of first and second peptide monomer 
units which associate to form a protein structure according to the invention. The structure can 
be denvahsed and/or stabilized by cross-linking. 

Derivatization of the peptide monomer units before or after assembly into the protein 
structures of the invention may be performed. For example, fluorescent moieties 
(fluorophores) may be attached to the coiled coil as described in W099/1 1774. The addition 
of fluorescent moieties may assist visualization of the protein structure. Substitution with 



or 
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functional groups at the "f" position in the heptad repeat is especially preferred as that 
position is on the outside of the helix (see Fig. 1C and IE). Other derivatives may include 
attaching binders to the peptide monomer units for example so that units which can bind other 
entities can be produced. 

The first and second peptide monomers and the strands may have the characteristics described 
above. 

The invention also provides protein fibres produced by an association of protein structures 
according to the invention. 

The protein structures may also be arranged to 'form tubular structures. In particular, the 
structures may be arranged to form nanotubes. 

According to another aspect of the invention, there is provided a kit for making protein 
structures, the kit comprising first and second peptide monomer units which associate to form 
a protein structure or protein fibres according to the invention. 

The protein structures of the invention may be assembled in two and three dimensional arrays. 
For example, two dimensional mats can be formed which can function, for example as filters. 
Three dimensional grids or matrices can also be formed again, for example, for use as sieves 
or filters or for organising other associated or conjugated molecules in three dimensions. 

In a preferred embodiment, a matrix is assembled in situ. For example, a matrix can be 
formed in a solution to entrap contaminants in the solution and then the matrix, together with 
contaminants, can be removed from the solution for example by centrifugation. 

The stability of the protein structures at higher temperatures may be improved by making the 
peptide monomers longer, such that the overlap between corresponding first and second 
monomer unit residues is increased. Increases in monomer length have previously been 
shown to stabilize coiled coil structures. Alternatively, stability can be improved by 
introducing bonding between adjacent peptide monomer units in the same strand. For 
example, Kent (Dawson et al (1994) Science 266: 776) and co-workers have produced peptide 
bonds between adjacent polypeptide units by coupling and subsequent rearrangement of a 
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cysteine residue at the N end of one polypeptide unit to a thio-ester dSatised C-terminus of 
another unit. 

Additionally, the protein structures may be stabilised and derivatised by using them to 
template the polymerisation of synthetic polymers. 

Definitions 

The terms used in the specification are to be given the ordinary meaning attributed to them by 
the skilled addressee. The following is given by way of clarification: 

Amino acid. 

This term embraces both naturally-occuring amino acids and synthetic amino acids as well as 
naturally-occuring amino acids which have been modified in some way to alter certain 
properties such as charge. In all cases references to naturally-occurring amino acids may be 
considered to include synthetic amino acids which may be substituted therefor. 

Coiled Coil 

A coiled-coil is a peptide/protein sequence usually with a contiguous pattern of hydrophobic 
residues spaced 3 and 4 residues apart, which assembles (folds) to form a multi-meric bundle 
of helices. Coiled-coils including sequences with multiple offset repeats are also 
contemplated. 

Dimer 

A dimer is a two stranded structure. 
Heterodimer 

A heterodimer is a dimeric structure formed by two different stands. 
Staggered heterodimer 

A staggered heterodimer is a structure in which the two strands assemble to leave overlapping 
ends that are not interacting within the heterodimer. 
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Blunt-end assembly is association where the two strands combine to give flushed i.e 
non-overlapping ends. 

Protofibril 

A protofibril is a protein structure assembled longitudinally from staggered heterodimers 
interacting through their overhanging ends. 

Fibre 

A fibre is a structure formed by lateral association of two or more protofibrils. 

Protein structures and methods of producing protein structures in accordance with the 
invention will now be described, by way of example only, with reference to the accompanying 
Figures 1 to 8 in which: 

Fig. 1 illustrates the design and the sequences of self-assembling fibre (SAF) peptide 
monomers of the invention. 

Fig. 2 illustrates computer modelling of the designed self- assembling fibre of the invention. 

Fig. 3 illustrates the results of circular dichroism (CD) and linear dichroism (LD) experiments 
on protein structures of the invention. 

Fig. 4 illustrates the assembly of synthetic protein fibres visualized directly by transmission 
electron microscopy and an analysis of fibre width In all panels, the white scale bars represent 
100 nm. Fig. 4D is a histogram showing the distribution of fibre widths determined using 
TEM for fresh (white bars) and matured (black bars) mixtures of SAF peptides at 100 nM (a 
width value of "x" on the histogram includes all measurements made from "(x-5) to x"). 

Fig. 5 is a cartoon showing the possible anti-typic association of parallel helical peptides 
leading to a homo-oligomeric peptide nanotube. 

Fig. 6 is an x-ray diffraction pattern of an aligned protein fibre of the invention. 



0121646A1J > 
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Fig. 7 is an electromicrograph showing fibres which have been derivatised through the 
inclusion of fiurophores; and 

Fig. 8 shows amino acid sequences designed to form blunt-ended heterodimers. 
1) Peptide Design and Synthesis 

Various peptide monomer units were designed as described above. The monomers and 
capping peptides (designed to complement the sticky ends of the monomers so as to produce 
flush, or blunt ends and, so, arrest longitudinal fibre assembly) are set out in Table 1 : 
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Fig. 1 shows (A) A mechanism for self-assembly: complementaiyfharges in "companion" 
peptides direct the formation of staggered, parallel heterodimers; the resulting "sticky" ends 
are also complementary and promote longitudinal association into extended structures. Fig 
1(B) shows the designed amino acid sequences: each peptide comprised canonical heptad 
repeats (abcdefg) with lie at a and Leu at d to guide the formation of coiled-coil dimers; 
oppositely-charged residues were incorporated at e and g to favour the staggered dimer with 
sticky ends; asparagine residues (which preferentially pairs with each other at a sites 
(Gonzalez L et al (1996) Nature Structural Biology 3, 13: 1011-1018) were included to 
cement the prescribed register further and to favour the parallel structures. Fig. 1(C) is a 
helical-wheel representation, summarizing the designed sequences in context. The view is 
from the N-terminus with heptad sites labeled a-g and assumes 3.5 residues per helical turn to 
emphasise the heptad repeat. 

The peptides were synthesized on an Applied Biosystems 432A Peptide Synthesizer using 
solid-phase methods and Fmoc chemistry. Peptide samples were purified using 
reversed-phase HPLC and their identities confirmed by MALDI-TOF mass spectrometry. 

Various combinations of peptide monomers and capping peptides were tested as set out in 
Table 2: 



SUBSTITUTE SHEET (RULE 26) 



WO 01/21646 



11 



PCT/GB00/03576 



CO 

< 



< o 

S © 



° ^~ 

U3 — S 



o <^ 

< - 



(SI =M 



a. 



3 



in c 



£ ^ O — 
.S c s c f= 
cs c — «* 

Si *<§2 



Si 

o £ 



t-i r- O 



£ E «* E 

g e g e 



o 



= P « 2 »• 



3 o 



i H 



lelix 


O 

a 




on) 


0> 


a 


36/46 


oc 

CJ 


Icracti 


a 

s£ 


\o 


u 


c 

X 


no in 


o 

CO 

I 



.2 p 



o 

no 



OC - 



CJ 



^ ^ 3 g H 
v-> ^ >f o £ 



I 



{J C 3 w 

* 5 = s 

u-> O ^-o (J 

OO (N ^ U 



3 













2 £ 


Ob 

o o 


o i 

o o 


o | 
o o 


~" Ofl 




*~ 00 


<§>.£ 

il 


@).E 

u 


<D.E 

il 




(A U 





a. 
< 

<J 



< 



< 


< 


C2 


C 


CJ 


CJ 


a 


CM 






rsi 










c 


C_ 


c_ 


c 


a. 


c 
















< 


< 


< 


< 


< 


< 


< 


CO 




CO 


CO 


co 


CO 


CO 
















< 


< 


< 


a 


< 


CJ 


a 
















& 


D. 


a. 


c 


c 


c 


c 












iL 




< 


< 


< 


< 


< 


< 


< 


oo 


CO 


co 


CO 


CO 


CO 


CO 



•I 2 S2 



-a 

i 



SUBSTITUTE SHEET (RULE 26) 



WO 01/21646 PCT/GBOO/03576 

In addition and as a control, the SAF-plc sequence was permuted (N- and C -terminal halves 
were swapped) to produce peptide SAF-p3: 



E I DALE YE NDALEQK IAALKQK IASLKQ 

This design should combine with S AF-p2D to form a blunt-ended structure, which should not 
form fibres. 

2) Modeling of Protein Fibre Structure 

A model of the three-dimensional structure of the designed protein fibre resulting from the 
assembly of SAF-pl and SAF-p2 was made from the minimised structure of a model 
coiled-coil 35-mer, (LAALAAA) 5 , which was generated using Crick's Equation and had an 
ideally packed interface (G. Offer and R. Sessions, J. Mol Biol. 249, 967 (1995)). Copies of 
the 35-mer were superimposed with an overlap of one heptad repeat to extend the structural 
template, and the backbone was rejoined after removal of overlapping segments. Residues in 
the two-stranded template were replaced with the sequences of the SAF peptides, staggered 
relative to each other by two heptad repeats according to the alignment in Fig. IB. The 
structure was soaked in a 5 A layer of water and energy minimised until the average absolute 
derivative of coordinates with respect to energy fell below 0.01 kcal A -1 . The structure was 
built and visualized using Insight II 97.0 (Molecular Simulations Inc.), and was 
energy-minimized using Discover 2.9.8 (Molecular Simulations Inc.) with the consistent 
valence forcefield. In Fig 2(A) peptides SAF-pl and SAF-p2 (each coloured dark 
grey-to-light grey from the N-terminus) interact through core residues including asparagine 
pairs (coloured mid-grey) to form the two strands of a staggered, parallel, coiled-coil fibre. In 
Fig. 2(B), negatively charged glutamate side chains (coloured light grey) and positively 
charged lysine side chains (coloured black) form complementary charge interactions between 
the SAF peptides. 



3) Circular Dichroism Experiments 

Peptide samples were incubated at 5°C in 10 mM MOPS (3-(N-Morpholino)propanesulfonic 
acid), pH 7. Sample concentrations were determined from their UV absorbance at 280 ran 
(SAF-pl) and 214 nm (SAF-p2). After baseline correction, ellipticities in mdeg were 
converted to molar ellipticities (deg cm 2 dmol-res" 1 ) by normalizing for the concentration of 
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peptide bonds. Data were recorded in a cell of 1 mm path length by integrating the signal for 
5s (and Is for the fresh 100 peptide mixture) every nm in the range 205-260 run. CD 
measurements were made using a JASCO J-715 spectropolarimeter fitted with a Peltier 
temperature controller. . 

The CD data shown in Fig. 3 provides spectroscopic evidence for the formation of helical 
structures by the SAF peptides. Fig. 3(A) shows circular dichroism (CD) spectra at 10 jiM 
for: SAF-pl ( — ), SAF-p2 (- -), the average of these spectra (- - -), and the experimental SAF 
peptide mixture (O). Fig 3(B) shows CD spectra at 100 ^M - the key is the same as for Fig 
3(A), but with the additional spectrum (•) being for the SAF peptide mixture after 
"maturation" for 1 h. 

Consistent with our design, neither SAF-pl nor SAF-p2 was highly structured in aqueous 
solution at pH 7 and 5 °C (Fig. 3). However, when mixed in equal proportions the circular 
dichroism (CD) spectrum changed and, moreover, was markedly different from the 
theoretical spectrum generated by averaging the spectra for the isolated peptides. In 
particular, the spectrum for the mixture had intense minima at 208 and 222 nm consistent 
with the formation of a-helical structure, but these features were not as pronounced in the 
spectra of the individual peptides. This was clear evidence that the two peptides interacted to 
form an a-helical structure as designed. Furthermore, and as expected for a multimerization 
event, the magnitude of these spectral changes depended on peptide concentration; a SAF 
mixture with 10 p,M of each peptide, did show a weak signal indicative of some a-helical 
structure, however, a 100 \xM mixture gave a much stronger signal (Figs. 3A&B). 

The shape and intensity of spectra from 100 jiM mixtures of the SAF peptides also changed 
with time (Fig. 3B). Spectra recorded immediately after mixing a "fresh" sample displayed 
some a-helical structure. After incubation of the mixture for 1 hour at 5 °C ("maturation"), 
however, the signal at 222 nm was more intense, and indicated approximately 75 % a-helix, 
consistent with substantial coiled-coil formation. 

Maturation of 100 jiM SAF peptide mixtures was also accompanied by slight clouding of the 
samples. Scattering effects from such samples can lead to attenuation and distortion of CD 
spectra (D. Mao and B. A. Wallace, (1984) Biochemistiy 23, 2667). However, we could 
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disregard this possibility because altering the distance between the s^fle and the detector in 
the CD instrument did not affect the shape or the intensity of the spectrum. Furthermore, we 
established that the majority of the CD signal from the mixtures derived from the suspended 
material: a supernatant without the suspended material, which was recovered by 
centrifiigation of a matured 100 uM SAP mixture, gave only a weak CD signal similar to the 
1 0 uM mixture. 

Thus, the CD data were wholly consistent with the desired a-helical SAP design and, 
moreover, indicated the formation of large assemblies. 

As a control, SAF-p3 (the permutation of SAP-pl (identical to SAF-plc)) was designed to 
form a blunt-ended heterodimer with SAP-pl that should not assemble further into fibres. 
100 uM mixtures of SAP-p2 (identical to SAF-p2D) and SAP-p3 were analysed by 
sedimentation equilibrium in the analytical ultracentrifuge. The resulting data were best 
fitted assuming a single ideal species in solution, and the molecular weight was allowed to 
vary during the fit. An M r of 6422 (with 95% confidence limits of 5924 and 691 1) was 
obtained, which is very close to the expected heterodimer value of 6303 calculated from mass 
spectrometry of the individual peptides. CD spectra for 100 uM fibre-producing mixtures 
(SAP-pl with SAF-p2), and for blunt dimer-producing mixtures (SAF-p2 with SAP-p3), were 
recorded. For the blunt dimer-producing mixtures, the shape and intensity of the CD 
spectrum were fully consistent with coiled-coil formation as designed. In contrast to the 
fibre-producing mixtures, the blunt dimer-producing mixtures showed no signs of maturation; 
that is, negligible spectral changes and no clouding of solutions occurred upon incubation. 
Interestingly, the intensity of the minimum near 222 nm, which is an accepted indicator of 
a-helical structure and degree of a-helical folding, was similar for both mixtures. This 
strongly supports the formation of a-helical structure as designed in the fibre-producing 
mixtures despite the spectral shifts observed upon maturation. 

4) Linear Dichroism Experiments 

Linear dichroism (LD) spectroscopy was also used to test if elongated structures were being 
formed as designed. Long polymers such as DNA molecules can be oriented by shear flow. 
This effect can be monitored by LD spectroscopy provided that chromophores also become 



SUBSTITUTE SHEET (RULE 26) 



WO 01/21646 



PCT/GB00/03576 




15 



aligned by the flow (MTBloemendal (1994) Chem. Soc. Rev. 23, 265; A. Rodger and B. 
Norden (1997) Oxford Chemistry Masters (Oxford University Press, Oxford), vol. 1). 

Peptide samples were prepared for LD as for CD. LD data were collected on samples 
spinning in a couette flow cell by integrating the signal for 2 s every nm in the range 210-320 
nm, using a JASCO J-715 spectropolarimeter. After baseline correction, absorbance was 
converted to molar extinction coefficient (1 mol-res" 1 cm' 1 ) by normalizing for the 
concentration of peptide bonds. A linear correction for a sloping baseline was made to the 
data from the 100 |iM SAP peptide mixture. 

The results are depicted in Fig. 3D, which shows linear dichroism (LD) spectra for: 20 \iM 

tropomyosin ( ), the SAF peptide mixture at 10 |iM ( — ), and the SAF peptide mixture at 

100 jiM in the absence (♦) and presence (O) of 0.5 M KF. 

For instance, we found that tropomyosin, which forms a dimeric coiled coil approximately 42 
nm in length, could be aligned to give a LD signal (Fig. 3D). In contrast and consistent with 
our design and the CD data, experiments with a 1 0 |iM SAF mixture, (Fig. 3D), and for the 
individual peptides at 100 |iM (data not shown), LD signals were not detected. However, a 
matured 100 |iM SAF peptide mixture gave a strong absorbance from the peptide backbone 
(210-240 nm) and some signal in the aromatic region (260-290 nm) during flow orientation 
(Fig. 3D). As only long structures are aligned by this technique, the data demonstrated that 
long fibres at least 500 nm in length were present in solutions of the matured 100 jiM SAF 
peptide mixtures. 

5) Electron Microscopy 

To confirm fibre assembly, we used electron microscopy to visualize structures in the peptide 
preparations directly. For TEM experiments, peptide samples were incubated for 1 h at 5 °C 
in filtered 10 mM MOPS, pH 7. A drop of peptide solution was applied to a carbon-coated 
copper specimen grid (Agar Scientific Ltd, Stansted, UK), and dried with filter paper before 
negative staining with 0.5% aqueous uranyl acetate and then dried at 5 °C. A "fresh" SAF 
peptide mixture was prepared by mixing preincubated solutions of the individual peptides at 
200 jiM directly on the specimen grid, before drying and negative staining as described. 
Grids were examined in a Hitachi 7100 TEM at 100 kV and digital images were acquired 
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with a (800 x 1200 pixel) charge-coupled device camera (Digital Pixel Co. Ltd., Brighton, 
UK) and analyzed (Kinetic Imaging Ltd., Liverpool, UK). 

For scanning electron microscopy (SEM) experiments, negatively-stained specimen grids 
were sputter-coated with gold and examined in a Leo Stereoscan 420 SEM at 20 kV and with 
a probe current of 10 pA. 

No structures were visible up to 100 000 times magnification by transmission electron 
microscopy (TEM) for either the 10 jiM SAF mixture, or for the individual peptides at 100 
(iM concentration (data not shown). However, TEM of a 100 ^iM SAF mixture at 50 000 
times magnification revealed time-dependent formation of long fibrous structures, consistent 
with the CD and LD data. Fresh mixtures showed large numbers of extended fibres of 
various widths. The majority of these had a diameter of about 20 nm (Figs. 4A (a fresh 
mixture at 100 jiM) & Fig 4D); finer fibres were present, but their widths could not be 
measured reliably. Images recorded for the matured mixtures showed fewer fibres, but these 
were more distinct and thicker than those observed in the fresh mixture (Fig. 4B&D). 

Scanning electron microscopy (SEM) of a matured mixture showed no evidence for fibre 
branching. Rather, the fibres were simply intertwined as if layered on top of each other (Fig. 
4C). It was not possible to follow the full length of fibres due to intertwining, but they were 
at least several hundred microns in length. Although the density of fibres varied across the 
surface of the EM grid, for the matured samples at least, their diameters were quite uniform 
with a mean width of 43.3 (SD = 9.3) nm (Fig. 4D). As the original design was for a 
longitudinally extended, but otherwise two-stranded coiled coil the average diameter that we 
might have expected was about 2 nm. Therefore, the EM data suggested that the designed 
two-stranded coiled-coil fibres associate laterally into higher order assemblies. 

6) X-ray Fibre Diffraction 

Mixtures of SAF peptides at 500 (J.M in 10 mM MOPS, pH 7, were incubated on ice for at 
least Ih, before centrifugation at 6500g for 5 min. Droplets of fibre-containing solutions, 
taken from the bottom of the centrifuged tubes, were suspended between the ends of two 
wax-filled capillaries and allowed to dry slowly overnight at 4°C, yielding clumps of partially 
aligned fibres. X-ray fibre diffraction images were collected using a Rigaku CuKa rotating 
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anode source (wavelength 1.5418 A) and a R-AXIS IV detector. Samples were maintained at 



fibre diffraction pattern collected from SAF peptide fibres showed the following features 
(Fig. 6): (1) a short meridional (that is, parallel to the long fibre axis) reflection at 5.1 1± 0.03 
A; (2) the harmonic of this 5.11 reflection at 10.19 ± 0.05 A; and (3) a stronger, more diffuse 
reflection centered at 8.8 ± 0.15 A on the equator. These features are consistent with 
a-helical coiled-coils aligned with the fibre axis. The 5.1 A meridional reflection 
corresponds to the pitch of the helices within the coiled-coils. The other expected reflection 
on the meridian-that is, that at 1.5 A and corresponding to the rise per residue-lies out of the 
resolution of the current data sets, whereas the equatorial reflection reveals the mean distance 
between cx-helical axes. This value at 8.8 A is less than the observed value for keratin but 
falls within reported ranges for dimeric coiled-coil peptides. 

7) Effect of Potassium Fluoride on Protein Fibre Assembly 

Molecular modeling of the SAF sequences into an extended two-stranded coiled coil also 
highlighted potential complementary charge interactions on the surface of the protofibrils, 
Figs 1&2. In accordance with this, experimentally it was found that moderate concentrations 
of salt inhibited protofibril and thick fibre assembly. First, CD spectra recorded for both the 
individual peptides and a 100 \xM mixture of SAF peptide samples with 0.5 M potassium 
fluoride showed reduced helical CD signals and there was no evidence of "maturing" in the 
mixed samples (Fig. 3C). Second, the LD signal described previously for the matured 100 
\xM SAF peptide mixture was also lost when the experiment was repeated in the presence of 
salt (Fig. 3D). Finally, TEM images of a 100 jiM SAF mixture also demonstrated that fibres 
were not formed in the presence of 0.5 M KF (Fig. 4E). Fig. 4E shows the results of TEM of 
amatured SAF peptide mixture at 100 jiM incubated in the presence of 0.5 M KF. 
The inventors did not knowingly design any features into the SAF peptides to foster further 
association of the two-stranded coiled coils. The observation of thick fibres in SAF peptide 
preparations, therefore, raised the question: what interactions guided and stabilized these 
higher-order assemblies? The inventors therefore propose that features inherent in repeating 
structures of the type that they designed will naturally promote such fibre assembly 
(fibrillogenesis). 



5°C during data collection with cool air from a cryostream (Oxford Cryo-systems). The X-ray 
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8) Coiled-coils design 
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anRula: 



coiled-coil these angular offsets are altered to 103°, 154° and 51°, respectively. However, 
both sets of angles are over-simplifications when considering helix-helix interactions in 
actual coiled-coil systems because side-chain size, geometry and packing also affect the 
helix interfaces (Harbury, P. B. et al (1993) Science 262, 1401-1407; Harbury, P. B. et al 
(1994) Nature 371, 80-83; Malashkevich, V. N. et al (1996) Science 274, 761-765). 
Nonetheless, we found that many natural coiled-coil assemblies, at least, were consistent 
with the approximate angular offsets: Trimers could be considered as having overlapping 
heptads separated by 3 residues (angular offset = 51/60°). Whereas, tetrameric and 
pentameric coiled-coils were often variations on a theme with two hepad repeats offset by 
1 residue (100/103°). 

b. Two heptad repeats offset by two residues: cc-cylinder constructions 

Sequence offsets of 2 residues are potentially more interesting than the 1- and 3-residue 
offsets. This is because of the possibility of placing hydrophobic (H) residues at a, c, d, 
and f, with c and f effectively making up the a 1 and d* positions of the second, offset 
heptad. This is represented below, where P signifies polar (non-core) residues. 



abcdefgabcdefg repeat 1 

HPPHPPPHPPHPPP binary pattern 1 

PPHPPHPPPHPPHP binary pattern 2 

f'g'a'b'c'd'e'f'g'a'b'c'd'e' repeat 2 

abcdefgabcdefg assigned register 

HPHHPHPHPHHPHP overall binary pattern 



Such sequence patterns would results in two hydrophobic seams with a wide angular 
separation (154/160°), which would place them roughly on opposite sides of the helix. 
Furthermore, it offers two possibilities for parallel helix-helix packing arrangements: syn y 
where two like faces - i.e a / d with a / d , or c/f with c/f - from neighbouring helices 
combine to produce an opena-sheet, Fig. 6a; anti, where a/d faces pair with c/f. In the 
anti-arrangement the structure can close to form a a-cylinder. For antiparallel pairs of 
helices syn-typic association should lead to cylinders, whereas sheets should be formed 
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from anti-typic antiparallel interfaces. 

c. A natural a-cylinder 

TolC has two a-barrel-like domains (Koronakis, V. et al (2000) Nature 405, 914-919). 
Both have 12 helices contributed by 3 monomers. In the lower barrel each helix pairs 
with another from the same protomer to form separate supercoiled, antiparallel 
coiled-coils; SOCKET analysis revealed extensive antiparallel knobs into holes (KIH) 
interactions within these pairs, but not between them. In contrast, the helices of the upper 
barrel appear to pack more uniformly, albeit with a slant, to describe an a-cylinder. The 
SOCKET output for this part of the structure revealed many fewer KIH interactions than 
found in the lower barrel. Furthermore, KIH interactions were not contiguous around the 
cylinder and, in particular, they were more extensive between helices in the same 
monomer, but less regular between the helices abutting the monomers. In our view, the 
TolC barrel represents a variation of the cylinders formed by protein structures of the 
invention. 

Nevertheless, the inventors were able to assign heptad registers for the helices of the 
upper barrel unambiguously. This revealed knobs at relative a, c, d, and /positions and 
syn-typic association of two seams adjacent helices; i.e fully consistent with the theory 
outlined above. 

We believe that it will be possible to constuct a-sheets and a-cylinders using helices in 
parallel. The use of parallel helices does have one interesting consequence for the 
construction of a-cylinders, however: as the pairing in these structures will be anti-typic, 
a residues on one helix partner c residues of a neighbouring helix at the same level in the 
structure. Similarly, d and / residues pair at the intervening levels. The result will be that 
successive helices will be translated up the helix and cylinder axes by two residues, which 
is equivalent to «3A. Thus, attempts to construct a-cylinders from parallel helices will 
give spirals of helices which may or may not close. This is, however, potentially 
extremely interesting as it opens up possibilities for making peptide-based nanotubes as 
described above. 
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A second consideration for a-cylinder construction is the consequences of helix and 
coiled-coil supercoiling. The upper barrel of TolC has 12 helices. Based on a structure of 
parallel helices with canonical supercoiling, i.e an angular separation of 154° between the 
two seams in each helix, we calculated that the cylinder should close at 14 helices. 
However, variations in helix number are expected. One reason for this is that helices 
cannot supercoil in two direction simultaneously, and some distortion is required to 
maintain packing at both interfaces. We found structural precedents for this in the Protein 
Data Book PDB where tight knobs-into-holes packing was maintained (Walshaw & 
Woolfson, unpublished); indeed, the central helices of the 3 -helix a-sheets are straight, 
Fig. 7b. (n.b . The slanting of the helices in the upper barrel of TolC may offer a 
compromise between straight and supercoiled helices). Assuming the packing of 
completely straight helices, the angular offset becomes 160° and 18 helices would close a 
cylinder. However, given that, as in 3-, 4- and 5 -stranded coiled coils, side chains 
mediate the helix-helix contact angles other oligonmerisation states might be possible 
(Harbury, P. B et al (1993) Science 262, 1401-1407; Harbury, P. B. et al (1994) 371, 
80-83; Malashkevich, V. N. et al (1996) Science 274, 761-765): we calculate that small 
adjustments in the angular offset between 144° to 162° varies the helix number from 10 to 
20. 

9) Formation of Protein Structures 

As mentioned above, the protein structures of the invention may have various applications 
such as in: 

Nanotubes 

a. This can be achieved for example by combining the aforementioned 7- and 11 -residue 
repeats with offsets in the sequence. The effect would be eliminate the overall 
hydrophobic displacement. In other words, alternating heptad and hendecad repeats give 
an 18-residue repeat to match the a-helical repeat; in the ot-helix, 18 residues span 5 
helical turns exactly. It may therefore be possible to create a completely closed peptide 
nantotube (Fig. 5 shows part of a nanotube) In the parallel, straight helix case there would 
be 18 helices per turn of the "cylinder", and the rise per turn is 36 residues. Thus, a 
36-residue peptide with a 7-11-7-11 repeat offset by 2 residues should form a spiral of 
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helices the ends of which meet to close the tube. Such nanotubes maybe particularly 
useful in the production of nanoscale piping and plumbing. The interior of the tube may 
be derivatised to control the flow of different small (2-40A) molecules. 



b. Derivitised and branched peptides and peptide templates 

The self-assembling peptides of the invention are relatively small and synthetically 
accessible. Thus, non-standard derivatisable side chains may be incorporated in them. 
For example, the monomer units can be made with a single cysteine residue at an exterior 
f position. These can be used to couple small molecules and other peptides using 
thiol-based chemistry. A wide variety of thiol-reactive probes are available. In particular, 
the peptides can be tagged with fluorophores. For instance, with one peptide labelled 
with Fluorescein and the other with Rhodamine fibres visualised by confocal microscopy 
appear green and red, respectively (Fig. 7). There is a possibility for FRET between the 
probes, which may pack closely in the fibres, and this may confuse interpretation. To 
avoid this the tagged peptides can be doped into fresh, assembling SAF mixtures. Having 
available fluorescently labelled peptides and fibres offers another route to tracking 
fibre/network assembly and orientation. 



To generate branched self-assembling fibres "T-shaped" conjugated peptides can be 
made. These are covalent heterodimers made by mixing and coupling together variants of 
two SAF peptides: one with a terminal cysteine and the other having a central cysteine 
residue. The desired products can be purified from the mix of disulphide-linked peptide 
by PHLC. Doping the conjugated ("T") peptides into fresh SAF mixtures should 
propagate fibre assembly in three dimensions as both the "bar" and the "stem" of the "T" 
could become incorporated in, or initiate, fibres. The resulting networks can be visualised 
and characterised by EM. 



Peptide synthetic diblock copolymer hybrids may be produced. Suitable methods for 
preparing water soluble diblock copolymers using atom transfer radical polymerisation 
are described in X. S. Wang et al Chemical Communications 1817 (1999) and X. S Wang 
et al Macromolecules 33, 257 (2000). 



The protein fibres of the invention may be used to template and control this 
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polymerisation eiti^Wo produce hybrid fibres or if the peptia^emplate is subsequently 
disassembled and marked away, to provide routes to water soluble "fishnet" nanotubes. 
Other possibilities include: conjugating polymers onto preassembled peptide fibres; 
conjugating the polymers and peptides prior to fibre assembly; and effecting 
polymerisation on the pre-assembled fibres. 

c. Formation of Matrices 

The protein fibres of the invention may be arranged to form two and three dimensional 
grids and matrices respectively. One application for such matices is in the purification of 
biological fluids such as blood. An affinity matrix could be assembled (for example in 
situ in blood) to remove blood contaminants such as viruses. In the case of virus removal, 
a binder for the target contaminant (e.g a peptide or protein with natural or engineered 
affinities for a viral coat protein) can be fused to a peptide monomer units in the-protein 
structure of the invention. The matrix can then be removed from blood along with any 
bound contaminants by light centrifugation. For example, it is estimated that a 100 nm 
length of fibre would have a mass of > 12 MDa which would readily be removed. Such 
affinity matrices have a number of advantages over larger naturally occurring proteins. In 
the assembled matrices any binders are aligned to give high effective avidities for the 
targeted molecules. 

d. Other applications 

Other applications for protein structures in accordance with the invention include: 

i. preparation of organised networks for seeding the crystalisation of biomolecules for 
X-ray crystallography; 

ii. using ordered fibres to promote cell growth for tissue engineering; 

iii. the construction of nanoscale molecular sieves 

iv. the preparation of nanoscale molecular grids/scaffolds that could be used as supports 
for a variety of functional small or macromolecules. 

v. fimctionalised grids and networks could be used in, for example, catalysis, 
affinity-sieving/purification of biological fluids and other research solutions, the 
recruitment of endogenous molecules and co-factors to promote tissue repair and 
tissue engineering in general. 
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vi. to create novel lai 




-chip technologies, peptide self-assei 




could be combined 



with lithography as follows. 

Lithography and related techniques can be used to pattern a variety of surfaces with 
channels, which can be made of a suitable size (e.g 20-100 nm wide and deep) to 
accommodate peptide fibres. These can then be used to direct the assembly of the 
fibres from solutions mixed directly on the surfaces. Furthermore, using 
well-established chemistry, the inventors envisage funtionalising the peptide fibres 
with a variety of small molecules and other proteins. This proposed combination of 
peptide design, self-assembly and lithography should allow the development of 
ordered arrays of functional polymers on specific surfaces, 
vii. Assembled fibres could also be used as fine (therefore, high resolution) tips in AFM 
(atomic force microscopy) the current limit is about 10-25 nm using carbon 
nanotubes. 
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Claims 



1 . A protein structure comprising a plurality of first peptide monomer units arranged in a 
first strand and a plurality of second peptide monomer units arranged in a second strand 
in which a first peptide monomer unit in the first strand extends beyond a 
corresponding second peptide monomer unit in the second strand in the direction of the 
strands. 

2. A protein structure according to claim 1 in which the strands together form a coiled coil 
structure. 

3. A protein structure according to claim 1 or 2 in which at least one charged amino acid 
residue of a first peptide monomer unit is arranged to attract an oppositely-charged 
amino acid residue of a second peptide monomer unit. 

4. A protein structure according to claim 3 in which the charged amino acid residue is in 
an end portion of the first peptide monomer unit which extends beyond the 
corresponding second peptide monomer unit in the second strand. 

5. A polypeptide structure according to any preceding claim in which at least one strand 
consists solely of first or second peptide monomer units respectively. 

6. A protein structure according to any preceding claim in which the peptide monomer 
units comprise a repeating structural unit. 

7. A protein structure according to claim 6 in which the repeating structural unit 
comprises a heptad repeat motif (abcdefg). 

8. A protein structure according to claim 6 in which the repeating structural unit 
comprises a hendecad repeat motif (abcdefghijk) 
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9. A protein structure according to claim 6 having isoleucine or asparagine at position a 
and leucine at position d. 

10. A protein structure according to claim 6 having valine or leucine at positions a and d 
respectively. 



1 1 . A protein structure according to any one of claims 7 to 1 0 having oppositely-charged 
otherwise complementary residues at positions g and e of respective monomer units. 



or 



are 



12. A protein structure according to claim 1 1 in which the oppositely-charged residues 
glutamic acid and lysine residues or asparagine and aspartic acid residues, or synthetic 
derivatives of these amino acid residues. 

13. A protein structure according to any preceding claim in which the structure is stabilised 
by pairs of asparagine, arginine, lysine or other complementary residues provided by 
corresponding first and second peptide monomer units. 

14. A protein structure according to any preceding claim which is arranged to form a 
tubular structure. 

15. A protein structure according to claim 14 in which the peptide monomer units are offset 
by two or more amino acid positions in sequence whereby the peptide monomer units 
form a cylinder. 

16. A protein structure according to any preceding claim in which the first and second 
peptide monomer units have the sequence: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl) and 

b) KI RALKAKNAHLKQE I AALEQE I AALEQ (SAF-p2) respectively; or 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA) and 
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d) KIRALKWKNAHLKQEIAALEQEIAALEQ ( SAF-p2C) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC) and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C) respectively. 

17. A peptide monomer unit for use in preparing a protein structure the peptide monomer 
unit having an amino acid sequence selected from: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl); 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2); 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-pl A); 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ ( SAF-p2C); 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC) ; and 
d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C). 

18. A protein structure or peptide monomer unit according to any preceding claim in which 
at least one amino acid residue is derivatised. 

19. A method of producing protein structures, the method comprising providing a mixture 
of first and second peptide monomer units which associate to form a protein structure 
according to any preceding claim. 

20. A method according to claim 19 in which the protein structure is derivatised. 

21. A method according to claim 19 or 29 in which the protein structure is stabilised by 
cross-linking. 
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22. Protein fibres produced by an association of protein structures according to any one of 
claims 1 to 3 or a method according to claim 19, 20 or 21. 

23. A kit for making protein structures, the kit compnsmg first and second peptide 
monomer units winch associate to form a protein structure accordmg to any one of 
claims 1 to 13 or protein fibres according to claim 22. 

24. A two dimensional matrix comprising a protein structure according to any one of 
claims 1 to 13 or protein fibres according to claim 22. 

25. A three dimensional grid comprising a protein structure according to any one of claims 
1 to 13 or protein fibres according to claim 21. 

26. A matrix according to claim 25 which is arranged to assemble in solution. 

27. A matrix according to claim 25 or 26 which is arranged to bind a target entity. 

28. A matrix according to claim 27 which is arranged to bind viruses. 

29. A method of forming a matrix according to any one of claims 25 to 28 in which a 
mixture of separate first and second monomer units is provided and are then caused to 
associate to form a protein structure in accordance with the invention, an accumluation 
of such protein structures assembling in turn to form a three dimensional matrix 



30. A method according to claim 29 in which the matrix is formed 



in situ. 



31 . A method for controlling the production of a synthetic polymers comprising assembling 
a pro.cn structure in accordance to any one of claims 1 ,o 16 in association with the 
polymer. 
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32. A method according to claim 31 in which the protein structure is removed after 
synthesis of the polymer. 

33. A tip for use in Atomic Force Microscopy comprising a protein structure according to 
any one of claims 1 to 16. 
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Peptides 

This invention r e,a,es „ protein fibre formalion and ^ ^ ^ ^ 

protetn fibres ,o fomr a protein structure comprising a p,ura lit y of firs, poiypepttde untts 
arranged iu a firs, polypeptide straud and a pharauty of second po.ypepfide untts arranged in 
a second poiypeptide strand, the strands preferabiy fencing a coiled eoil structure, and in 
whteh a firs, po,ypep, ld e unit in the fi rst SIrand extends beyo „ d , COTIe 
polypepttde unit in the second strand in the direction of the strands. 



SUBSTITUTE SHEET (RULE 26) 



WQ 01/21646 



7/70 



PCT/GB00/03576 




WO 01/21646 PCT/GB00/03576 



2/10 



00 
CM 



co 




^ w w w 



ro vo 

CM CM rH 

H M S H 



vo o% cm 

CM • H rH LD 

hi hi hi 



cm in 

CM rH CO rH 

H W « « 



in co h 

CM fH 1-4 

<J < ffl <c 



rH 
CM 

& 

CO 
CM 

o 



ID 

CM rH 

<: <u w 



in cm 

rH CM rH CO 

&3 W « « 



CM 




CM in rH 

hi hi hi 



VO CO 
CM C\ rH CM 

H !3 H H 



ro o 

VO rH CM CM 

« « w w 



o r*- ^ 

CO rH rH CM 

ffj <: 



SUBSTITUTE SHEET (RULE 26) 



PCT/GB00/03576 



4/10 




l i I i i " — i — ' — i — i — r 




(e-0 1- * L_S8J-|oiup z mo 69p) 
Ai!0!ld!||3 jb|0|A| 



SUBSTITUTE SHEET (RULE 26) 



WOd/21646 



5/10 



PCT/GB00/03576 



en 
CD 




CSJ 



cvj -^r co oo o 



CM 



UOIJOinptt JB|OLU |B!}U9J8^!Q 




CD lO CD lO O UO 

"V T ■ ^ 

( s -0l x t _saj-|owp z iuo Bap) 
AiioiidniB JB|0|A| 

SUBSTITUTE SHEET (RULE 26) 



INSDOCID: <WC_0121 646A1 J_> 




SUBSTITUTE SHEET (RULE 26) 



WO ,01/21 646 



7/10 



PCT/GBOO/03576 




CD 

















r * 














?" 








■"• i-fyf .y-" 




- • 






• 
































% : 
















1 ■ 


























» - 


































v ■■ *' 
















* f 




t 








/ 
















if' 
,* 
















.i 
















'-• >,- 




/" 








































. ... 








i 




































1 



SUBSTITUTE SHEET (RULE 26) 



iNSDOCID: -WO 0121C4GA1_I_^ 



WO 01/21646 



PCT/GBOO/03576 



8/10 



0g 



V777, 



VZZZZZZZ 



WZZZZZZZZZ 



wzzzzzzzzzz 



r 



T 



CD 
CO 



LO 



LO +i 



CD 



LO 
CO 



CD 
CO 



LO 
C\J 



CD 
CM 



T 1 1 1 1 

CD CD CD CD CD CD CD CD CD CD 

en co f — co m co c\j r- 




Q3 








-Q 
| 


+ 




1 




D 


** 


+ 



en 



SUBSTITUTE SHEET (RULE 26) 




GB00/03576 



SUBSTITUTE SHEET (RULE 26) 

3NGDOC1D: <WO 0121048A1J_^ 



WO 01/21646 



10/10 



PCT/GB00/03576 

• 




SUBSTITUTE SHEET (RULE 26) 



INTERNASCNAL SEARCH REPORT 



Application No 

00/03576 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C07K14/00 C07K1/113 G11B9/00 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C07K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical search terms used) 

EPO-Internal, BIOSIS, WPI Data, PAJ, CHEM ABS Data, MEDLINE 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5 712 366 A (KAPLAN DAVID L ET AL) 
27 January 1998 (1998-01-27) 
column 6, line 35-57; figures 6A.6B 

WO 96 11947 A (GOLDBERG EDWARD B) 
25 April 1996 (1996-04-25) 



page 7-9; figures 3,4 
page 25-26 



1-15, 
18-30 



1-6,14, 
15, 

18-20, 
22-30 



-/- 



LH 



Further documents are listed in the continuation of box C. 



ID 



Patent family members are listed in annex. 



° Special categories of cited documents : 

*A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim(s) or 
which rs cited to establish the publication date of another 
citation or other special reason (as specified) 

•O* document referring to an oral disclosure, use, exhibition or 
other means 

•P* document published prior to the international filing date but 
later than the priority date claimed 



*T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance: the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 



8 January 2001 



Date of mailing of the international search report 

17/01/2001 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patent laan 2 
NL-2280HVRIjswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo n». 
Fax (+31-70) 34O-3016 



Authorized officer 



Cervigni, S 



Fofm PCT/1SA/210 (second shoot) (Juty 1992) 
NSDOCID: <WO 0121646A1_I_> 



page 1 of 2 



INTERNATIOJ 



SEARCH REPORT 



C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication. where appropriate, of the relevant 



passages 



KOJIMA SHUICHI ET AL: "FIBRIL FORMATION 
BY AN AMPHIPATHIC ALPHA-HELIX-FORMING 
POLYPEPTIDE PRODUCED BY GENE ENGINEERING " 
PROCEEDINGS OF THE JAPAN ACADEMY SERIES B 
PHYSICAL AND BIOLOGICAL 
vol. 73, no. l, 1997, pages 7-11, 
XP000971780 
1997 

ISSN: 0386-2208 
abstract 

r D L ? ll KA ET AL: "REVERSIBLE HYDROGELS 
FROM SELF-ASSEMBLING ARTIFICIAL PROTEINS" 
SCIENCE,AAAS. LANCASTER, PA, US, ™ U ' tlNS 
vol. 281, 17 June 1998 (1998-06-17) naoes 
389-392, XP002149253 P 9 

abstract; figure 2 

KOHN WD ET AL: "De novo design of 
alpha-helical coiled coils and bundles- 
models for the development of 
protein-design principles" 

TRENDS IN BIOTECHNOLOGY, NL, ELSEVIER 
AMSTERDAM, * 
vol. 16, no. 9, September 1998 (1998-091 
pages 379-389, XP004173181 
ISSN: 0167-7799 



Relevant to claim No. 



1-15,19, 
22-30 



Form PCT/ISA/210 (continuation ot second shoot) (July 1992) 



1-13, 

18-20, 

23-30 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



Ian on patent family members 



PCT^re i 



Application No 

PQlJW 00/03576 



Patent document 




Publication 


Patent family 


Publication 


cited in search report 




date 


member(s) 


date 


Uj 0/1 c jOO 


A 


97— n 1 —1 QQQ 


NONE 






LIH Qf\1 1 QA7 


A 
M 




us 


5877279 A 


02-03-1999 








AU 


689662 B 


02-04-1998 








A 1 1 

AU 


3829695 A 


06-05-1996 








BR 


9509487 A 


OA AA i /"\ /-\ -7 

30-09-1997 








CA 


2202474 A 


25-04-1996 








CN 


1168676 A 


24-12-1997 








EP 


0785946 A 


30-07-1997 








HU 


77683 A 


28-07-1998 








JP 


10508194 T 


18-08-1998 








NZ 


295046 A 


25-03-1998 








US 


5864013 A 


26-01-1999 



Form PCT /ISA/210 {patem tamity annex) (Jury 1992) 

siSDOc.in- <wo m?ifi4fiAij_? 



THIS PAGE BLANK (USPTO) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 
REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 




THIS PAGE BLANK (usfto) 



